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Many bacteria have evolved ways to interact with glycosylation 
functions of the immune system of their hosts. Streptococcus 
pyogenes [GAS (group A Streptococcus)] secretes the enzyme 
EndoS that cleaves glycans on human IgG and impairs the effector 
functions of the antibody. The ndoS gene, encoding EndoS, 
has, until now, been thought to be conserved throughout the 
serotypes. However, in the present study, we identify EndoS 2 , an 
endoglycosidase in serotype M49 GAS strains. We characterized 
EndoS 2 and the corresponding ndoS2 gene using sequencing, 
bioinformatics, phylogenetic analysis, recombinant expression 
and LC-MS analysis of glycosidic activity. This revealed that 
EndoS 2 is present exclusively, and highly conserved, in serotype 
M49 of GAS and is only 37 % identical with EndoS. EndoS 2 
showed endo-/3-N-acetylglucosaminidase activity on all N-linked 



glycans of IgG and on biantennary and sialylated glycans of AGP 
(a i -acid glycoprotein). The enzyme was found to act only on 
native IgG and AGP and to be specific for free biantennary glycans 
with or without terminal sialylation. GAS M49 expression of 
EndoS 2 was monitored in relation to carbohydrates present in the 
culture medium and was linked to the presence of sucrose. We 
conclude that EndoS 2 is a unique endoglycosidase in serotype 
M49 and differs from EndoS of other GAS strains by targeting 
both IgG and AGP. EndoS 2 expands the repertoire of GAS 
effectors that modify key glycosylated molecules of host defence. 

Key words: c^-acid glycoprotein, endo-jS-N-acetylglucosamini- 
dase, host-pathogen interaction, IgG glycosylation, Streptococ- 
cus pyogenes. 



INTRODUCTION 

Glycosylation is a common post-translational modification, and 
almost all key molecules in the immune system are glycosylated 
[1]. IgG is the most abundant antibody in serum with the capacity 
to bind and neutralize antigens, facilitate antibody-dependent 
cytotoxicity, opsonize antigens and initiate phagocytosis. IgG is 
composed of two light and two heavy chains, of which the latter 
are glycosylated with complex N-linked glycans at Asn 297 . The 
presence and structure of this glycan is of major importance for 
the interaction of the antibody with FcyRs (Fey receptors) and 
for the subsequent effector functions elicited by the antibody 
[2-A] . The glycan is present in a pocket of the two heavy chains 
of the IgG molecule, where it has been shown to be flexible and 
dynamic allowing it to influence the gly can-protein interaction 
with FcyR [5]. IgA, IgD, IgE and IgM each carry several occupied 
N- and O-linked glycosylation sites, and the study of the glycan's 



impact on the effector functions of these immunoglobulins has 
only begun [6]. 

Streptococcus pyogenes [GAS (group A Streptococcus)] is 
a leading Gram-positive bacterial pathogen exhibiting a wide 
array of immune evasion mechanisms, including interference 
with host glycosylation [7]. Every year, this bacterium causes 
over 500000 deaths due to severe infections and post-infectious 
immunological disorders: invasive infections, rheumatic fever, 
glomerulonephritis and hundreds of millions of cases of milder 
and self-limiting infections, such as pharyngitis and impetigo [8]. 
GAS is subdivided into serotypes on the basis of the antigenic 
M-protein on the bacterial surface and there are currently over 
100 serotypes described [9]. 

An endoglycosidase from S. pyogenes, EndoS, was discovered 
in serotype Ml of GAS and found to hydrolyse the N-linked 
glycan on the heavy chain of native human IgG and in this 
way modulate the binding of IgG to FcyR [10-12]. EndoS (EC 
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Abbreviations used: 2-AB, 2-aminobenzamide; ABS, Arthrobacter ureafaciens sialidase; AGP, o^-acid glycoprotein; AMF, almond meal a-fucosidase; 
BEH, bridged ethane-silicon hybrid; BKF, bovine kidney a-fucosidase; BTG, bovine testes ,8-galactosidase; CM, C-medium; CcpA, catabolite control protein 
A; FcyR, Fey receptor; FLD, fluorescence detection; GAS, group A Streptococcus; GH18, family 18 of glycoside hydrolases; HILIC, hydrophilic interaction 
liquid chromatography; HRP, horseradish peroxidase; LCA, Lens culinaris agglutinin; 4MU-GlcNAc, 4-methylumbelliferyl N-acetyl-^-D-glucosaminide; 
MWCO, molecular-mass cut-off; NAN1, neuraminidase/sialidase 1; PNGase F, peptide N-glycosidase F; r, recombinant; UHPLC, ultra-HPLC. 
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2 Patents for the use of EndoS 2 have been applied for by Genovis AB. Jonathan Sjogren, Andrew Hollands, Victor Nizet, Maria Allhorn and Mattias 
Collin are listed as inventors on the application that is pending. 

3 Present address: Chemistry Research Laboratory, Department of Chemistry, University of Oxford, Oxford OX1 3TA, U.K. 

4 Present address: Waters Corporation, 34 Maple Street, Milford, MA 01757, U.S.A. 

The sequences of Streptococcus pyogenes ndoS2 will appear in the DDBJ, EMBL, GenBank® and GSDB Nucleotide Sequence Databases under 
accession numbers KC155346 (strain 3487-05), KC155348 (strain AP49), KC155347 (strain ACN49), KC155349 (strain AW1) and KC155350 (strain AW2). 
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3.2.1.96) belongs to GH18 (family 18 of glycoside hydrolases) 
and has endo-jS-N-acetylglucosaminidase activity (CAZy, 2012; 
http://www.cazy.org). Enzymes in the family GH18 hydrolyses 
/J-l,4-linked GlcNAc and this group of enzymes contains both 
chitinases (EC 3.2.1.14) hydrolysing the carbohydrate chitin and 
endo-yS-N-acetylglucosaminidases (EC 3.2.1.96) with described 
endoglycosidase activity on the chitobiose core of N-linked 
complex glycans (CAZy, 2012). 

EndoS is expressed in late stationary phase during streptococcal 
growth and the catalytically active glutamate residue (Glu 235 ) and 
several tryptophan residues are required for enzymatic activity 
[13]. Different from other described bacterial endoglycosidases, 
EndoS hydrolyses the N-linked glycan only on native and not 
denatured IgG [14]. Complement activation by the classical 
pathway was reduced when antibodies were treated with EndoS 
[11]. In human blood, the recombinant enzyme has been shown 
to deglycosylate IgG, and, in an opsonophagocytic killing assay, 
recombinant EndoS was shown to increase bacterial survival 
[11]. The contribution of EndoS to GAS virulence has been 
studied in a mouse model of invasive infection, and, although 
of minor importance in the wild- type Ml bacteria, it increased 
virulence of other GAS strains when heterologously expressed 
[15]. As a strategy to treat autoimmune diseases, EndoS has 
shown promise as a biotherapeutic in a number of animal 
models of autoimmunity [16-21]. For the biotechnology industry, 
the enzyme has applications both as a tool in the analysis 
of monoclonal antibodies (Genovis AB) and potentially for 
chemoenzymatic glycoengineering [22,23]. 

In the genome of GAS strain NZ131 of serotype M49, we 
have identified the gene ndoS2 encoding the enzyme EndoS 2 
[24]. ndoS2 holds 53% identity with ndoS and the proteins 
EndoS 2 and EndoS are 37% identical. The GAS strain NZ131 
is a clinical isolate from a case of acute post-streptococcal 
glomerulonephritis in New Zealand [24]. Serotype M49 belongs 
to a serotype grouping of GAS associated with skin infections 
and glomerulonephritis, group II (M2, M42, M49, M56, M57 and 
M60), rather than throat infections and rheumatic fever (Ml, M4, 
M12 and M25) that define group I [24,25]. 

In the present study, we characterize EndoS 2 using 
bioinformatics, recombinant expression and LC-MS analysis to 
study the glycosidic activity. 

MATERIALS AND METHODS 
Bacterial strains and growth 

The genome of S. pyogenes GAS strain NZ131 of serotype 
M49 has been sequenced and this strain was therefore selected 
as the reference strain in the present study [24,25]. GAS 
was propagated on blood agar, Escherichia coli strains Top 10 
(Invitrogen) and BL21 pLysS (Invitrogen) were propagated on 
lysogeny broth agar and used for cloning and recombinant 
expression. All strains used are summarized in Supplementary 
Table SI (http://www.biochemj.org/bj/455/bj4550107add.htm). 
For selection in E. coli ToplO cells, carbenicillin was used 
at lOOAtg-mr 1 and, for E. coli BL21 pLysS, 100/xg-mr 1 
carbenicillin and 34/xg-ml -1 chloramphenicol were used. 
Overnight cultures of E. coli were carried out in lysogeny 
broth at 37 °C with aeration. Genomic DNA preparation of 
GAS strain NZ131 was performed using Puregene DNA 
Purification Kit (Qiagen). Transformation was carried out 
using heat-shock at 42 °C for 30 s. Plasmid preparations 
from E. coli were performed using Plasmid Miniprep Kit I 
(Omega Bio-Tek). All primers used are listed in Supplementary 
Table S2 (http://www.biochemj.org/bj/455/bj4550107add.htm). 



Expression of EndoS 2 was studied using growth of NZ131 in 
50% CM (C-medium) [0.5% Proteose Peptone, 1.5% (w/v) 
yeast extract, 10 mM K 2 P0 4 , 0.4 mM MgS0 4 and 17 mM NaCl 
(PH7.5)]. 



Sequencing of ndoS2 

Five GAS serotype M49 strains were selected for sequencing 
of the ndoS2 gene; 3487-05, AP49, ACN49, AW1 and AW2. 
Sequencing was carried out using primers ndoS2-out-R, seq38-R, 
seq42-R, seq54-R, seql5-F, seql7-F, seq24-F and seq28-F and 
the Lightrun sequencing service of GATC Biotech (Konstanz, 
Germany). All primers used for sequencing are summarized in 
Supplementary Table S2. The sequences have been deposited 
in GenBank® with accession numbers as follows: KC155346 
(strain 3487-05), KC155348 (strain AP49), KC155347 (strain 
ACN49), KC 155349 (strain AW1), KC 155350 (strain AW2) 
(Supplementary Table S2). 



Recombinant expression of EndoS 2 

Recombinant expression of EndoS 2 in E. coli was 
established by PCR amplification of the ndoS2 gene 
from GAS NZ131 with the primers ndoS2-F-BamHI, 
5'-CTGTAA GGATCC AGGAGAAGACTG-3', and ndoS2-R- 
Xhol, 5'-GAAAC CTCGAG TCTTTGTAATCGTAGGACTT-3'. 
The ndoS2 fragment was digested with restriction enzymes 
BamHI and Xhol (restriction sequences are underlined) and 
ligated into the expression vector pGEX-5X-3 (GE Healthcare) 
using DNA ligase T4 (Thermo Fisher Scientific) creating the 
plasmid pGEX-ndoS2. The expression vector was transformed 
into E. coli ToplO chemically competent cells and screened 
with PCR using primers ndoS2-F-BamHI and ndoS2-R-XhoI. 
Positive clones were isolated and the pGEX-ndoS2 plasmid 
was purified and transformed into the E. coli expression strain 
BL21 pLysS. One recombinant clone was grown overnight at 
37 °C with antibiotics, diluted 1:20 in lysogeny broth medium 
with antibiotics and grown for 3 h to mid-exponential phase. 
The expression of the protein GST-EndoS 2 was induced with 
0. 1 mM IPTG for 3 h. The cells were harvested and lysed 
with BugBuster Protein Extraction Reagent (Novagen/Merck). 
Recombinant GST-EndoS 2 was purified on a column 
with glutathione-Sepharose 4B (GE Healthcare) and eluted with 
reduced glutathione. The GST tag was cleaved off using Factor 
Xa (New England BioLabs). Site-directed mutagenesis was 
performed on pGEX-ndoS2 using a QuikChange® II Site- 
Directed Mutagenesis Kit (Agilent) with primers ndoS2(E-L)- 
F and ndoS2(E-L)-R, to exchange the glutamate residue (E) in 
the active site to leucine (L). Recombinant EndoS 2 (E186L) was 
cloned and expressed in a similar way to EndoS 2 . 



Phylogenetic analysis 

We searched non-redundant protein databases at NCBI with 
the BLASTP algorithm, submitting the EndoS 2 sequence of 
GAS strain ACN49 (M49). We retrieved similar protein 
sequences, setting a cut-off of the expect value at <10~ 10 . 
All but two sequences belonged to GH18, except for two 
sequences of other hypothetical proteins. All sequences were 
included in phylogenetic analyses of the proteins, and the 
non-GH18 proteins served as outgroup. A total of 101 
protein sequences were aligned in Geneious version 6.0.3 
(Biomatters Ltd, available from http://www.geneious.com/) 
using the ClustalW algorithm (Supplementary Table S3 at 
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http://www.biochemj.org/bj/455/bj4550107add.htm). From the 
alignment of 1817 amino acids, a region of generally high 
coverage comprising 1099 amino acids was extracted and 
analysed in BEAST version 1.7.4 [7,26]. We used the Blosum62 
substitution model [8,27] with no site heterogeneity specification, 
set a strict molecular clock, selected the Yule process [9,28] 
for tree prior and ran Bayesian MCMC sampling every 
1000 generations for 10 million generations. The output was 
examined with Tracer version 1.5 (A. Rambaut and A.J. 
Drummond, available from http://beast.bio.ed.ac.uk/Tracer) in 
order to ensure that likelihood scores were stationary and that 
effective sample sizes were adequate (>500), setting burnin 
to 25 % . A maximum clade credibility tree was calculated 
with TreeAnnotator version 1.7.4 [1,26]. 16S rRNA (RNA or 
cDNA) sequences were retrieved from the Ribosomal Project 
Database (http://rdp.cme.msu.edu) and GenBank® for taxonomic 
analyses of the corresponding taxa/strains in the EndoS dataset 
(Supplementary Table S3). In case a specific strain was lacking 
for taxa more distantly related in the EndoS protein tree dataset, 
another strain was chosen if available. This resulted in a total 
of 51 representative sequences, which were aligned according to 
the procedures described for the EndoS protein dataset above. 
From the resulting alignment of 2172 nt, a high coverage region 
comprising 1576 positions was extracted and analysed in BEAST 
version 1.7.4 [2-A,26\ with similar parameter settings except 
for the substitution model [HKY with rate variation across 
sites following a discrete gamma distribution (G) with four 
rate categories]. Verification and tree calculation followed the 
procedures above, and the tree was rooted using the fungal 
sequences, which were a part of the dataset, as outgroup. 



EndoS 2 expression analysis 

Overnight cultures of NZ131 in CM were diluted 1:50 and grown 
for 16 h at 37 °C with 5 % C0 2 and the cysteine protease inhibitor 
E-64 at 20 /iM. Glucose, galactose, sucrose, acetylglucosamine 
and mannose were added at 0.01 % concentration when 
indicated. Supernatants were concentrated using precipitation 
with 0.3 mM TCA (trichloroacetic acid). The samples and 
0.5 fig of recombinant EndoS 2 were resuspended in SDS/PAGE 
loading buffer and loaded on to a 10% Bis-Tris gel. The 
electrophoresis was performed at 180 V for approximately 
60min and stained with PageBlue Protein Staining Solution 
(Fermentas). Blotting on to a PVDF membrane was performed 
according to the manufacturer's instructions using Trans-Blot 
Turbo (Bio-Rad Laboratories) equipment. The membrane was 
blocked in 5 % (w/v) dried skimmed milk powder (Difco) 
and incubated with 10 /xl of rabbit polyclonal anti-EndoS 2 for 
1 h at 37 °C with rotation. Washing was consistently carried 
out in PBST (PBS with 0.05% Tween 20) three times for 
10 min. The membrane was washed and incubated with 2.5 fil of 
HRP (horseradish peroxidase)-conjugated goat anti-(rabbit IgG) 
(H + L) (Bio-Rad Laboratories), washed and developed using 
Supersignal West Pico Chemiluminescent Substrate (Thermo 
Fisher Scientific). Antiserum against EndoS 2 was obtained using 
1 mg of recombinant EndoS 2 to immunize rabbits following 
standard protocols by Davids Biotechnologie, Regensburg, 
Germany. 



N-glycan hydrolysis assay 

A sample of 1 fig of recombinant EndoS 2 , EndoS 2 (E186L) or 
EndoS or 500 units of PNGase F (peptide N-glycosidase F) was 
incubated with 3 fig ofIgGor5 /xgof AGP («! -acid glycoprotein) 



(Sigma-Aldrich) in PBS at 37 °C for 2h. For PNGase F, 
the substrate was denatured according to the manufacturer's 
instructions (New England BioLabs). Human IgG subclasses 
IgG^ (Calbiochem/Merck) were incubated with recombinant 
EndoS 2 , EndoS 2 (E186L) or PBS under the reaction conditions 
described above. All reactions were separated on a 10% Bis- 
Tris gel as described above. Lectin blotting was performed on 
a PVDF membrane (Millipore). The membrane was incubated 
in lectin blot buffer (10 mM Hepes, 0.15 M NaCl, 0.1 % Tween 
20, 0.01 mM MnCl 2 and 0.1 mM CaCl 2 ) for 1 h and incubated 
with 5 fig of biotinylated LCA (Lens culinaris agglutinin) (Vector 
Laboratories) in the same buffer. The membrane was washed three 
times for 10 min in lectin blot buffer and subsequently 2.5 fig of 
HRP coupled to streptavidin (Vector Laboratories) was added for 
1 h. The membrane was developed as described above. The 16 h 
bacterial supernatants were concentrated using lOkDa MWCO 
(molecular-mass cut-off) spin columns (Pall) and a functional 
assay on the activity of the supernatants on 3 fig of human 
serum IgG was performed. The reaction mixture was incubated 
at 37 °C overnight and analysed by SDS/PAGE (10% gel) and a 
subsequent LCA lectin blot as described above. 



Chitinase assay 

4MU-GlcNAc (4-methylumbelliferyl /V-acetyl-jS-D-glucosami- 
nide) (Sigma-Aldrich) was incubated at 0.2 mM with 0.3 m- 
unit of chitinase from Streptomyces griseus (Sigma-Aldrich) 
or 2 ixg of rEndoS 2 (where r denotes recombinant) or 2 fig of 
rEndoS or PBS in 100 /A of PBS. The reactions were incubated 
at 37 °C for 1 h. Then, 100 fil of 0.1 M glycine (pH 10) was 
added to stop the reaction. Absorbance at 355/445 nm was 
measured in a black 96-well plate using a spectrophotometer. The 
experiments were carried out using five replicates and results are 
shown as means+S.D. The response in absorbance was analysed 
statistically by an unpaired Student's t test, where differences 
were considered significant if P < 0.05. ****/> < 0.001. 



Glycoprotein denaturing 

A 4 fig amount of IgG or AGP was incubated in 10 fi\ of PBS 
at 37 °C, 40 °C, 50 °C, 60 °C, 70 °C or 80 °C for 30 min. After 
the incubation, the samples were kept at 37 °C. Then, 2 fig of 
rEndoS 2 was added to each reaction mixture and incubated further 
at 37 °C for 2 h. The samples were analysed on a SDS/PAGE 
gel and for IgG with LCA lectin blotting as described above. 
For analysis of EndoS 2 specificity, 4 fig of a 2 -macroglobulin, 
ovalbumin, human lactoferrin, RNase B and fetuin (all Sigma- 
Aldrich) were incubated with 2 fig of EndoS 2 at 37 °C overnight 
and subsequently analysed on SDS/PAGE gel as described. 

LC-FLD (fluorescence detection)-MS 

Online coupled LC-MS with FLD was performed using a Waters 
Xevo G2 QTof with Acquity UPLC and BEH (bridged ethane- 
silicon hybrid) glycan column (1.0 mm x 1 50 mm, 1 .7 fim particle 
size). MS data was acquired in negative mode with the following 
conditions: 2500 V capillary voltage, 50 V cone voltage, 280 °C 
desolvation temperature, 600 l-h -1 desolvation gas and 100 °C 
source temperature. The analyser was set to sensitivity mode. 
The fluorescence data rate was 1 point-s 1 and a PMT gain of 
10 with excitation and emission wavelengths set at 320 nm and 
420 nm respectively. Samples were in 80 % acetonitrile with an 
injection volume of 10 fil. The flow rate was 0.150 /xl-min -1 . 
Solvent A was 50 mM ammonium formate (pH 4.4) and solvent 
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Figure 1 Genetic context analysis of ndoS and ndoS2 

The genetic context of ndoS2 (from NZ131/M49) and ndoS (from MGAS5005/M1) was analysed by aligning and comparing the identity of the sequences in MacVector. 



B was acetonitrile. A 40 min linear gradient was used and was as 
follows: 28^13 % solvent A for 3 1 min, 70 % solvent A for 4 min 
and 28 % solvent A for 4 min. 



Exoglycosidase digestion arrays 

Analysis of glycan sequence, composition and linkage 
specificities was facilitated by the use of exoglycosidase digestion 
arrays. All digestion reactions were performed with enzymes 
from Prozyme. Fluorescently labelled glycans were digested 
in 50 mM sodium acetate (pH5.5) at 37 °C overnight using a 
panel of enzymes with each digestion reaction brought to a final 
volume of 10 /xl using double-distilled water. Digested glycans 
were then separated from the enzyme mixtures using lOkDa 
MWCO centrifugal filters. Digested 2-AB (2-aminobenzamide)- 
labelled glycans were then prepared for separation on UHPLC 
(ultra-HPLC) with fluorescence detection using a BEH glycan 
column as described previously [5,29]. Specific non-reducing 
end monosaccharides were removed as follows: terminal 
sialic acid in all linkages was removed with 1 m-umV/A 1 
ABS (Arthrobacter ureafaciens sialidase); terminal galactose 
monosaccharides were removed using 0.5 m-unit-jctl -1 BTG 
(bovine testes /3-galactosidase), which releases both /3(1,3)- and 
/ S(l,4)-linked galactose; terminal GlcNAc monosaccharides were 
released with 40 m-unit-/xl _1 GUH {Streptococcus pneumoniae 
hexosaminidase), capable of cleaving /J-linked GlcNAc moieties; 
core a(l,6)-fucose was selectively removed using 1 m-unit/xl 1 
BKF (bovine kidney a-fucosidase), (2,3)-linked sialic acid 
was removed using 10 m-unit-jctl -1 recombinant Streptococcus 
pneumoniae NAN1 (neuraminidase/sialidase 1), and AMF 
(almond meal a-fucosidase) at 6 m-unit-^tl" 1 was used to release 
(1,3)- and (l,4)-linked non-reducing terminal fucose residues. 



Activity of EndoS 2 on free N-glycans 

N-glycans present on 80 /xg of bovine fetuin (Sigma-Aldrich) 
were released using 2500 units of PNGase F (New England 
BioLabs), and labelled with 2-AB (Ludger). The labelled fetuin 
2-AB glycan pool was then incubated at 37 °C for 16 h in 
the presence of 80yLig-ml _1 EndoS 2 in PBS to determine the 
activity of EndoS 2 on free N-glycans. The resulting EndoS 2 - 
digested glycan pool was then relabelled with 2-AB. Each 
glycan preparation was separated using a 1.7 yum BEH glycan 
column (2.1 mmxl50mm, Waters) and analysed by UHPLC- 
FLD-MS using a Waters ACQUITY UPLC® H-Class Bio with 
fluorescence detection coupled to a Waters Xevo G2-S Q-ToF 
mass spectrometer. The column temperature was 40 °C with a flow 
rate of 0.4 mlmin~ 1 using a linear gradient of 50 mM ammonium 
formate (pH4.4) against acetonitrile with ammonium formate 
increasing from 30 % to 47 % over a 32 min period. Fluorescence 
detection was achieved using excitation and emission wavelengths 
of 330 nm and 420 nm respectively. Eluting glycans were detected 
in positive mode with the following settings: cone voltage of 
80 V, capillary voltage of 3.0 kV, source temperature of 120 °C, 



desolvation temperature of 300 °C, and desolvation gas flow of 
800 l-h -1 . Mass data were acquired using sensitivity mode with 
a mass range of 750 m/z to 2000 mlz with a 1.0 s scan time. Both 
LC-FLD and LC-MS data were acquired and processed using 
Waters UNIFI version 1 .6. 



RESULTS 

Identification of EndoS 2 from GAS serotype M49 

In the sequenced genome of GAS strain NZ131 (serotype M49), 
we identified ndoS2, a gene harbouring a GH18 domain [6,24]. 
ndoS2 from GAS serotype M49 was found in the same genetic 
context as ndoS from GAS serotype Ml, but showed only 53 % 
nucleotide identity with ndoS (Figure 1). The surrounding genes, 
i.e. scrb, sera, scrk and pmi, showed a high degree of nucleotide 
identity when comparing the chromosomal context between strain 
NZ131 and serotype Ml strain MGAS5005 (Figure 1). One 
genome of serotype M49 is available to the public (NZ131, 
GenBank® accession number NC_011375) and therefore ndoS2 
was sequenced in five M49 strains of different origin and isolation 
year (3487-05, ACN49, AP49, AW1 and AW2). The comparison 
revealed 100% identity of ndoS2 in the five selected strains 
compared with ndoS2 found in NZ131. The ndoS2 sequences 
have been submitted to GenBank® (Supplementary Table SI). 
The deduced amino acid sequence of EndoS 2 and EndoS revealed 
37 % identity when aligned using ClustalW (Figure 2). The signal 
peptide was conserved, but three major sections of the EndoS 
amino acid sequence were lacking in EndoS 2 ; at positions 45- 
83, 535-561 and 933-986, gaps can be seen in the alignment. 
A comparison of the active site of EndoS 2 and EndoS revealed 
the GH18 motif (DXXDXDXE) with glutamate at position 186 
as the catalytic amino acid to be conserved (Figure 2). Specific 
tryptophan residues have previously been shown to be important 
for the enzymatic activity of EndoS, and when EndoS 2 was aligned 
and compared with EndoS, tryptophan residues at positions 121, 
164, 332, 361, 391, 809, 828 and 907 were found to be conserved 
[10-13]. 

In order to evaluate the evolutionary history of EndoS 2 , 
we reconstructed a protein specific phylogenetic tree, using 
BEAST version 1.7.4, on 101 protein sequences selected with 
the BLASTP algorithm on EndoS 2 . EndoS 2 (depicted in blue) 
was found to be unique to GAS serotype M49 and relatively 
different from EndoS found in other serotypes of GAS as 
well as EndoS-like proteins in other Streptococcus species 
(Figure 3). This can be contrasted with the taxonomic phylogeny 
of the 16S rRNA sequences (Supplementary Figure SI at 
http://www.biochemj.org/bj/455/bj4550107add.htm). 



EndoS 2 hydrolyses the N-linked glycan on the heavy chain of IgG 

Previous work has concluded that EndoS hydrolyses the N-linked 
glycan on IgG [10,13]. Although the enzymes are different, we 
tested whether IgG is a substrate for EndoS 2 . A comparison of 
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Figure 2 ClustalW alignment of EndoS 2 and EndoS 

EndoS 2 from GAS strain NZ131 and EndoS from GAS strain MGAS5005 was aligned using ClustalW. Depicted in blue is the GH18 active site (DXXDXDXE) and in green are conserved tryptophan 
residues. 



the hydrolysis of the N-linked glycan on the heavy chain of IgG 
was carried out using EndoS 2 , EndoS and PNGase F as positive 
control [14,30]. PNGase F from Elizabethkingia meningoseptica 
cleaves between the GlcNAc and the asparagine residue of N- 
linked glycans, whereas EndoS cleaves between the two GlcNAc 
moieties in the chitobiose core of N-linked glycans [10,11]. 
EndoS 2 was mutated in the active site through site-directed 
mutagenesis where the catalytically active glutamate residue was 
mutated to leucine, creating the enzyme EndoS 2 (E186L). EndoS 2 , 



EndoS 2 (E186L), EndoS and PNGase F were incubated with 
human IgG in PBS at 37 °C overnight, and analysed by SDS/PAGE 
and a subsequent LC A (recognizing a-linked mannose) lectin blot 
(Figure 4A). The gel shows a ~4 kDa shift of the heavy chain of 
IgG and a corresponding lack of LCA lectin signal when incubated 
with EndoS 2 , EndoS or PNGase F, but not with EndoS 2 (E186L) 
or PBS (Figure 4A). This result indicates that EndoS 2 hydrolyses 
the N-linked glycan on the heavy chain of IgG and confirms the 
glutamate residue at position 186 of EndoS 2 to be the catalytically 
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Figure 3 Phylogenetic reconstruction of the protein-specific tree for EndoS-like proteins, based on 1099 amino acids 

All internal nodes were supported by a Bayesian posterior probability (PP) of 0.99-1.0 except for nodes highlighted with grey squares (0.80<PP<0.95) or black squares (PP<0.80). The scale bar 
indicates genetic distance. Non-verified hypothetical proteins predicted from gene models are indicated with grey branches, the EndoS 2 clade is highlighted in blue and the fungal clade is highlighted 
in red. For accession numbers, see Supplementary Table S3 at http://www.biochemj.org/bj/455/bj4550107add.htm. 



active amino acid. To evaluate enzymatic activity of EndoS 2 on the 
subclasses of IgG, recombinant EndoS 2 and EndoS 2 (E186L) were 
incubated with human IgG subclasses 1^1 and showed activity 
on all four human subclasses as analysed by SDS/PAGE and 
LCA lectin blot (Figure 4B). The glycan-hydrolysing activity 
of EndoS 2 on animal IgG was found for the following species: 
mouse, rat, monkey, sheep, goat, cow and horse. To investigate 
glycan specificity of EndoS 2 , the composition of the released 
glycans from pooled human serum IgG was analysed by HILIC 
(hydrophilic interaction liquid chromatography )-UHPLC-FLD- 
MS and compared with the glycan profile of IgG generated by 
PNGase F (Figure 5). The HILIC-UHPLC-FLD-MS revealed 
EndoS 2 to cleave between the two GlcNAc residues in the 
chitobiose core of the N-linked glycan and thus leaving a single 



GlcNAc residue with or without a(l,6)-linked fucose attached 
to the protein backbone. All peaks present in the PNGase F 
chromatogram could be found in the glycan profile of IgG released 
by EndoS 2 with the difference of one GlcNAc with or without 
a(l,6)-linked fucose. 



EndoS 2 releases biantennary and sialylated glycans on AGP 

AGP, also known as orosomucoid, is a 41^-5 kDa human 
plasma glycoprotein, a major positive acute-phase protein, up- 
regulated severalfold during inflammation and a member of the 
lipocalin family [1 1,31]. The immunomodulatory effects of AGP 
is linked to the carbohydrate composition of the five N-linked 
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Figure 4 Activity of EndoS 2 on IgG, IgG subclasses and AGP 

(A) Human serum IgG was incubated with recombinant EndoS 2 , EndoS 2 (E186L), EndoS, PNGase 
F or PBS at 37 ° C in PBS overnight and analysed by SDS/PAGE (1 0 % gel) and a subsequent LCA 
blot. The gel and lectin blot shows the y heavy chain of IgG at 50 kDa. (B) Human subclasses of 
IgG, IgGM, were incubated with recombinant EndoS 2 , EndoS 2 (E186L) or PBS in PBS at 37°C 
overnight and analysed by SDS/PAGE (10% gel) and a subsequent LCA lectin blot. (C) AGP 
was incubated with recombinant EndoS 2 , EndoS or PBS at 37°C in PBS overnight and analysed 
by SDS/PAGE (10% gel). 



glycans (Asn 33 , Asn 56 , Asn 72 , Asn 93 and Asn 103 ) that make up 
45% of the molecular mass [15,31]- When incubating AGP 
with recombinant EndoS 2 , subsequent SDS/PAGE revealed a 
new band at ~38 kDa and a decrease in the intensity of the 
band at 45 kDa (Figure 4C). No activity was detected with 
EndoS or PBS in the same assay (Figure 4C). To elucidate the 
enzymatic activity of EndoS 2 on AGP in detail, we analysed 
the glycans released from AGP by EndoS 2 using HILIC-UHPLC- 
FLD-MS and exoglycosidase arrays in UHPLC (Figure 6). The 
sequence, composition and linkage specificities of all glycoforms 
of AGP released by PNGase F were determined in the same 
way to serve as control. EndoS 2 was found to cleave only 
biantennary and sialylated structures of AGP, whereas the glycan 
profile from PNGase F contained sialylated bi-, tri- and tetra- 
antennary structures with or without outer arm fucosylation. 
The cleavage site of EndoS 2 was confirmed to be between 
the two GlcNAcs in the chitobiose core of the glycan. The 
glycan profiles were digested with NAN1 to remove a(2,3)- 
linked sialic acids and ABS to remove a(2,3)-, a(2,6)- and 
a(2,8)-linked sialic acid residues. The resulting bi-, tri- and tetra- 
antennary structures from AGP were identified as M — 2H 2 ~ ions 
(m/z 879.3, 1061.9, 1135.5, 1244.5 and 1317.5). Furthermore, 
glycans were digested with linkage-specific exoglycosidases 
to verify the presence of outer arm fucosylation. These 
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Figure 5 Glycan fluorescent profiles from human IgG released by EndoS 2 
and PNGase F 

HILIC— FLD— MS of 2-AB-labelled glycans released from human serum IgG by EndoS 2 (A) and 
PNGase F (B) respectively. Identified glycan structures are presented using the Oxford glycan 
nomenclature [45]. 



enzymes were BTG, BKF and AMF (Supplementary Figure 
S2 at http://www.biochemj.org/bj/455/bj4550107add.htm). AMF 
digestion removed a(l,3) non-reducing terminal fucose linked to 
galactose residues and not core a(l,6)-fucose. BKF treatment, 
which is specific for core a(l,6)-linked fucose residues, did not 
result in glycan digest products. 



EndoS 2 is specific for IgG and AGP and not a general chitinase 

Previous work on EndoS has shown that the enzyme is specific 
for the native form of IgG [14]. To test whether this is valid for 
EndoS 2 , IgG and AGP were incubated at temperatures ranging 
from 37 to 80 °C or 37 to 70 °C for 30 min before the addition 
of EndoS 2 or PBS and a 2 h incubation at 37 °C. SDS/PAGE 
analysis revealed a shift of IgG incubated at 37-50 °C and 
loss of signal was seen in a corresponding LCA lectin blot, 
whereas only partial shift could be seen at 60 °C and no shift and 
intact LCA signal at temperatures 70 °C and 80 °C (Figure 7A). 
Glycans from AGP were hydrolysed at 37 °C, but not at 40-70 °C 
(Figure 7B). The activity of EndoS 2 was tested further on a range 
of glycoproteins, i.e. a 2 -macroglobulin, ovalbumin, lactoferrin, 
RNase B and fetuin, but no activity could be detected (Figure 7C). 
To study whether EndoS 2 shows general chitinase activity, we 
employed the substrate 4MU-GlcNAc, which fluoresces when 
cleaved, to compare the enzymatic activity of EndoS 2 and EndoS 
with that of a chitinase from S. griseus. The results indicate 
that neither EndoS 2 nor EndoS has a general chitinase activity 
compared with the positive control (Figure 7D). 



EndoS 2 hydrolyses free biantennary glycans 

Following the findings of the specificity of EndoS 2 , we asked 
the question whether the enzyme is substrate-specific and/or has 
glycoform selectivity. To test this, we analysed the activity of 
EndoS 2 on free glycans. All glycoforms from bovine fetuin were 
released using PNGase F and were 2-AB-labelled; in a secondary 
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Figure 6 Glycan fluorescent profiles from human AGP released by PNGase F and EndoS z 

HILIC— FLD— MS of 2-AB-labelled glycans released from human AGP by PNGase F (left) and EndoS 2 (right) respectively. 2-AB-labelled glycans were digested further with NAN1 and ABS, and 
subsequent bi- tri- and tetra-antennary structures are indicated using the Oxford glycan nomenclature [45]. Ions were detected as [M-2H] 2 ~ (**)and [M- H]~ (*) species. 




Figure 7 Activity of EndoS 2 on native and denatured IgG and AGP, other glycoproteins and chitinase assay 

(A) IgG was incubated at temperatures ranging from 37 to 80°C for 30 min followed by incubation with EndoS 2 at 37°C for 2 h and analysis by SDS/PAGE and LCA lectin blot. (B) AGP was 
incubated at 37-70°C followed by incubation with EndoS 2 at 37°C for 2 h and analysis by SDS/PAGE. (C) EndoS 2 was incubated with a 2 -macroglobulin, ovalbumin, human lactoferrin, RNase B 
and fetuin at 37°C overnight and analysed by SDS/PAGE. (D) EndoS 2 , EndoS and a chitinase from S. griseus was incubated with the fluorescent substrate 4MU-GlcNAc for 1 h and fluorescence 
was measured at 355/445 nm. The experiments were carried out using five replicates and results are means+S.D. The response in absorbance was analysed statistically by an unpaired Student's f 
test, where differences was considered significant if P<0.05. ****P< 0.001. 
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Figure 8 EndoS 2 hydrolyses biantennary free glycans 

Bovine fetuin N-glycans were released with PNGase F, labelled with 2-AB, and analysed by HILIC-UHPLC-FLD-MS. Released N-glycans were digested further with EndoS 2 to determine enzymatic 
activity on free glycans. Comparison of the fluorescent chromatograms of glycans after PNGase F (B) and subsequent EndoS 2 digestion (A) identified three unique peaks (labelled 1, 2 and 3). 
These peaks correspond to three isomeric structures (A2G2, A2G2S1 and A2G2S2) and were detected primarily as m/z 1558.55 [M + H] + , 925.33 [M + 2H] 2 + and 1070.87 [M + 2HP+ ions 
respectively (CHE). Extracted ion chromatograms of A2G2S1 and A2G2S2 precursor ions identified structural isomers, presumably from variation in sialic acid linkages. 



reaction, the free glycans were incubated with EndoS 2 , relabelled 
with 2-AB and analysed using HILIC-UHPLC-FLD-MS. Three 
structures in the chromatogram (labelled 1 , 2 and 3) were modified 
by EndoS 2 compared with the PNGase F glycan pool (Figure 8A 
and 8B). The m/z of the [M + H] + ions of these structures were 
identified and revealed structures for A2G2 (1558.5513), A2G2S1 
(925.3294) and A2G2S2 (1070.8700) less one GlcNAc residue 
(Figures 8C-8E). The results indicate that EndoS 2 specifically 
hydrolyses free biantennary glycoforms with or without terminal 
sialylation. 



Expression of EndoS 2 is linked to carbohydrate utilization 

To confirm the findings with recombinant EndoS 2 , the expression 
levels and enzymatic activity of EndoS 2 were analysed in GAS 
supernatants. The expression of EndoS in serotype Ml of GAS 
is maximized in the nutrient-poor CM [10,16-21]. Therefore 
expression of EndoS 2 was analysed by Western blotting of 16 h 
bacterial supernatants grown in CM. However, EndoS 2 could only 
be detected in the supernatant when GAS was cultured in 50 % 
diluted CM, when the bacteria are starved (Figure 9A). In a 
subsequent functional assay incubating IgG with the bacterial 
supernatant, loss of the N-linked glycan on IgG was visualized 
as a 4 kDa shift of the heavy chain on the electrophoresis gel and 
corresponding lack of signal in the LCA lectin blot (Figure 9B). 
This experiment confirmed the activity of the native protein in the 
bacterial supernatant. The expression of EndoS 2 in poor medium 
led us to believe that the expression of EndoS 2 was linked to the 
carbohydrate utilization of the bacteria [22,23,32]. To address 
this, a selection of carbohydrates was added to the bacterial 
culture medium and the expression of EndoS 2 was studied. Adding 
glucose, galactose, GlcNAc or mannose to 50% CM inhibited 



EndoS 2 expression, whereas additional sucrose increased the 
amount of EndoS 2 in the supernatant (Figure 9A). 



DISCUSSION 

The study of bacterial glycosidases has emerged as a field at 
the intersection of microbial pathogenesis and glycobiology. By 
studying the mechanisms by which bacteria interfere with host 
glycosylation, new insight can be gained into both bacterial 
pathogenesis and the impact of glycosylation of the immune 
system. Interfering with the glycosylation of the host defence 
is widespread among pathogenic bacteria for modulation of the 
functions of the immune system or as a way of utilizing the glycans 
of glycoproteins as nutrients [24,33]. 

For example, Enterococcus faecalis, a Gram-positive gut 
bacterium and opportunist, secretes EndoE, an endoglycosidase 
with activity on the Fc-glycan on IgG and on the glycoprotein 
RNase B that promotes bacterial growth when nutrients are scarce 
[24,34]. The endoglycosidases EndoF^ fromis. meningoseptica 
and EndoH from Streptomyces plicatus has been shown to be 
glycan-specific: high-mannose and hybrid oligosaccharides are 
cleaved by EndoF! and EndoH, whereas complex biantennary 
and bi- and tri-antennary glycans are released by EndoF 2 and 
EndoF 3 respectively [24,25,35-38]. An N-glycan deglycosylation 
complex in Capnocytophaga canimorsus has been found to cleave 
off N-linked glycans from IgG and to transport the glycans 
across the cell membrane for glycan catabolism [24,25,39]. S. 
pneumoniae has three surface-anchored exoglycosidases that 
work in concert to remove sialic acid, galactose and GlcNAc 
on human glycoproteins [40]. GAS EndoS was thought to 
be conserved throughout the GAS serotypes, and only minor 
variations are found when comparing ndoS among the sequenced 
GAS strains. It was therefore surprising to find that GAS strain 
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Figure 9 EndoS 2 expression and activity in GAS strain NZ131 

(A) Western immunoblot showing EndoS 2 in bacterial supernalants with or without added 
carbohydrates. rEndoS 2 was used as a positive control. (B) Lectin blot analysis of human IgG 
incubated with bacterial supernatants with or without carbohydrates, or rEndoS 2 as a positive 
control. 



NZ131 harboured ndoS2, with 53% identity with ndoS. The 
sequenced ndoS2 in five different M49 strains revealed high 
identity, arguing that this gene is conserved throughout the 
serotype. 

In the phylogenetic protein tree, the EndoS 2 group is relatively 
different from EndoS in both S. pyogenes and in other 
Streptococcus species (but it groups with a hypothetical protein 
found in Streptococcus ictaluri). In general, the patterns are not 
uniform: whereas within-species or within-genus similarity of 
the EndoS-like proteins is high for some taxonomic groups (e.g. 
Bifidobacterium longum), there is also considerable within-genus 
variation in Corynebacterium (Figure 3). Strikingly, EndoS and 
EndoS 2 from Streptococcus are more closely related to EndoS-like 
proteins of the fungi Cordyceps militaris and Beauveria bassiana 
than to EndoS-like proteins of bacteria such as Melissococcus, 
Corynebacterium and Lactobacillus (Figure 3), in sharp contrast 
with the taxonomic relationships (Supplementary Figure SI). 
Notably, some taxa are paraphyletic in the EndoS-like protein 
phylogeny, to which could possibly be ascribed the inclusion 
of non-verified hypothetical proteins. However, the biologically 
verified EndoS-like proteins of Enterococcus gallinarum and E. 
faecalis do not form a monophyletic clade (Figure 3). In all, this 
picture indicates the occurrence of horizontal gene transfer of 
ndoS-like genes. Even though no known proteins were found to 
be closely related to EndoS 2 , the differentiation from S. pyogenes 
EndoS and the high degree of similarity between serotype M49 
and other S. pyogenes strains combined with the conserved genetic 
context points towards horizontal gene transfer of ndoS2 into 
serotype M49. The strain NZ131 also has an unusually high 
frequency of transformation, and horizontal gene transfer has been 
described on several places in the genome [24]. The alternative 
interpretation, that a particularly strong directional selection on 
the ancestral ndoS gene in serotype M49 resulted in ndoS2, seems 
less plausible. 

The active site and tryptophan residues important for activity 
in EndoS were found to be conserved in EndoS 2 even though 
the proteins are only 37 % identical [13]. Despite this substantial 
difference in amino acid sequences, EndoS 2 hydrolysed the glycan 
on IgG in a similar fashion to EndoS. In the chromatograms 
comparing the glycan profile of IgG generated by EndoS 2 and 
PNGase F (Figure 5), a shift was observed that could be explained 
by the site of action. PNGase F is an amidase that cleaves between 
the asparagine residue and the first GlcNAc residue of the glycan, 
whereas EndoS 2 cleaves after the first GlcNAc and thus leaves one 
GlcNAc with or without fucose attached to the protein backbone. 



Owing to lack of one reducing end GlcNAc in the EndoS 2 glycan 
profile, there is a loss in resolution, which explains why the A2G1 
peak could not be separated in the EndoS 2 chromatogram, but can 
be seen as two separate peaks in the PNGase F profile. It has been 
argued previously that EndoS does not cleave bisecting glycans 
[41,42]. From the LC-MS data of the present study, we argue 
that EndoS 2 cleaves all glycoforms present of human serum IgG, 
including bisecting glycans, since all peaks present in the PNGase 
F glycan profile could be found in the EndoS 2 profile (Figure 5). 

A striking difference between EndoS 2 and EndoS was found 
when incubated with the human acute-phase protein AGP. 
The observed activity of EndoS 2 was confirmed with LC- 
MS and revealed that EndoS 2 specifically releases biantennary 
and sialylated structures of AGP (Figure 6). Again, the peaks 
annotated in the EndoS 2 profile could be found in the PNGase F 
release with the difference of one GlcNAc residue. It is clear that 
EndoS 2 only releases a fraction of the glycans present on AGP. 
EndoS 2 does not cleave tri- and tetra-antennary glycans, with or 
without outer arm fucosylation, although they are present in great 
numbers on AGP. 

The activity on IgG and AGP raised several questions regarding 
the specificity of EndoS 2 . To answer these, we tested the activity of 
EndoS 2 on heat-denatured IgG and AGP, on other glycoproteins, 
in a chitinase assay and on a pool of free N-glycans. EndoS 2 
was only active on native IgG and AGP and we draw the 
conclusion that EndoS 2 requires a protein-protein interaction 
with its substrates for glycan hydrolysis to occur. The activity of 
EndoS 2 on AGP may be the result of reduced protein recognition, 
since early studies indicate sequence homology between IgG 
and AGP [43]. On glycoproteins with a completely different 
fold, we detected no activity with similar assays to the activity 
on IgG and AGP detected. It was therefore not surprising to 
find that EndoS 2 had no general chitinase activity compared 
with a chitinase from S. griseus. Taken together, these data 
indicate that EndoS 2 specifically interacts with protein folds 
including IgG and AGP. Furthermore, we dissected the glycoform 
specificity of EndoS 2 by incubating the enzyme with the N- 
glycan pool from fetuin released by PNGase F and showed 
that EndoS 2 hydrolysed only free biantennary structures with or 
without terminal sialylation. No bisecting glycans are present on 
fetuin, which explains why such structures are not present in the 
chromatograms. On the basis of our findings, we believe that 
EndoS 2 is both site- and glycoform-specific which is a unique 
property of an endoglycosidase. 

The hydrolysis of the glycan of IgG has been shown to have 
major consequences on the effector functions of the antibody by 
modulating the binding to FcyR [12]. Since EndoS and EndoS 2 
have similar hydrolysing activity on the glycan of IgG, both 
enzymes are expected to affect the functionality of this antibody. 
The functional consequence for AGP when biantennary sialylated 
glycans are cleaved off is unknown and lies beyond the scope of 
the present study. 

The expression of EndoS 2 was found to depend on the 
availability of carbohydrates in the bacterial culture medium. C- 
medium is a poor medium for GAS and expression of EndoS 2 
could only be detected when GAS was grown in 50% diluted 
C-medium. Incubating the supernatants with IgG confirmed the 
previous work carried out with recombinant EndoS 2 and a clear 
correlation between expression of EndoS 2 and hydrolysis of the 
Fc-glycan on IgG confirmed this. The genes scrb, sera and scrk, 
surrounding ndoS2, are part of a sucrose utilization operon and this 
could explain the increase of EndoS 2 expression when sucrose was 
added to the culture medium. The presence of glucose, galactose, 
GlcNAc or mannose completely inhibited expression of EndoS 2 
indicating that this enzyme is tightly regulated by a mechanism 
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sensitive to the presence of carbohydrates. Research has indicated 
that the virulence of GAS is linked to the utilization of available 
carbohydrates via CcpA (catabolite control protein A), but in the 
present study of EndoS 2 , we can only hypothesize that CcpA is 
involved in the regulation mechanism [32]. This indicates that in 
the infection scenario, EndoS 2 is strictly regulated and that the 
enzyme is used in an environment where nutrition is scarce, e.g. 
the human skin. This indicates further that the virulence of GAS 
is linked to the utilization of complex carbohydrates [44]. 

The present study shows that the endoglycosidase EndoS 2 is 
conserved and uniquely present in GAS serotype M49. We show 
that EndoS 2 hydrolysed all glycoforms on human serum IgG and 
biantennary and sialylated glycans on AGP. EndoS 2 is secreted 
by GAS during starvation and the expression is linked to the 
carbohydrate composition of the culture medium. The enzymatic 
activity on two key players of the immune system argues that 
EndoS 2 has a role in immunomodulation of the host that could 
potentially be linked to the pathogenesis of GAS serotype M49 
infections. 
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Cordyceps miliiaris CM01 
Bifidobacterium longum (2) 
Cory nebacten urn diphlenae (multiple) 
Corynebacterium ulcerans (2) 
Corynebaclerium pseudotuberculosis (multiple) 
Corynebacterium pseudotuberculosis (2) 
Brachybacterium faecium DSM 4810 
Helcococcus ttunzii 22 
Helcococcus kunzii M 1 1 2 
Paentbacillus popdliae ATCC 14706 
Paembacillus alvei DSM 29 
Listeriaceae bacterium TTU M 1-001 
Enlerococcus faecalis (multiple) 
Melissococcus plutomus (2) 
Camobactenum maltaromattcum ATCC 35586 
Streptococcus equi ssp. zooepidemicus MGCS10565 
Sireplococcus equi ssp. zooepidemicus ATCC 35246 
Sireplococcus equi ssp equi 4047 
Sireplococcus pyogenes MGAS15252 
Slreptococcus pyogenes MGAS8232 
Streptococcus pyogenes Alab49 
Streptococcus pyogenes MGAS10394 
Streptococcus pyogenes str. Manfredo 
Sireplococcus pyogenes MGAS1075O 
Sireplococcus pyogenes M1 GAS 
Sireplococcus pyogenes NZ131 
Streptococcus pyogenes MGAS9429 
Streptococcus pyogenes MGAS5005 
Streptococcus pyogenes MGAS10270 
Streptococcus pyogenes MGAS6180 
Sireplococcus pyogenes MGAS315 
Sireplococcus ictaluri 706-05 
Lactococcus garvieae ATCC 49156 
Elizabethkingia anophelis Agi 
Elizabethkingia memngoseptica ATCC 13254 

Figure S1 Phylogenetic reconstruction of 16S rRNA sequences, for 
inference of taxonomic relationship between taxa of the EndoS-like protein 
phylogeny, based on 1576 nucleotides 

All internal nodes were supported by a Bayesian posterior probability (PP) of 0.99-1.0 except 
for the grouping of the Listeriaceae family bacterium, highlighted with a black squares signifying 
PP<0.80. The scale bar indicates genetic distance. Note that the branches connecting the 
ingroup and outgroup are truncated owing to the large taxonomic difference (genetic distances 
stated below branches). For accession numbers, see Table S3. 
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Figure S2 HILIC-FLD-MS and exoglycosidase sequencing of AGP N-glycans 

The fucose linkage position was confirmed as «(1 ,3) to galactose residues on tri- and tetra-antennary glycans and were detected as [M — 2H] 2 ~ ions 972.9 and 1074.5. 



Table S1 E. coli and group A streptococcal strains used in the present study 

Strains of E coli and S. pyogenes with accession numbers. Also indicated is the GenBank® accession number for ndoS2 sequences. N/A, not applicable. 



Strain 



£ coli 
Top10 
BL21 pLysS 
GAS (serotype) 
NZ131 (M49) 
3487-05 (M49) 
ACN49 (M49) 
AP49 (M49) 
AW1 (M49) 
AW2 (M49) 



Source/accession number 



ndoS2 accession number 



Invitrogen N/A 

Invitrogen N/A 

ASM1812v1 ACI61688 

S. pyogenes, serotype M49 KC1 55346 

Strain 3274-98 from Center for Disease Control and prevention. A gift from Actinova KC155347 

S. pyogenes strain collection, Institute of Hygiene and Epidemiology, Prague, Czech Republic KC155348 

Nephritis strain H 9449 B from the collection of L. Wannamaker KC1 55349 

Strain H 5424 from the collection of L. Wannamaker KC155350 
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EndoS 2 hydrolyses N-linked glycans on IgG and ai-acid glycoprotein 



Table S2 Plasmids and primers used in the present study 



Plasmids and primers used for recombinant expression and sequencing, 
(a) Plasmids 



NsmG 


^ni irrp 


pb EX-5X-3 


GE Healthcare 


pb EX-ndob2 


The present study 


pGEX-ndoS2(E-L) 


The present study 


pGEX-ndoS 


[1] 


pCR2.1 


Invitrogen 


(b) Primers 




Name 


Sequence (5 — >o ) 


ndoS2-out-R 


GCGCCACTTTCTGGTAGTCTAAC 


Seq38-R 


TTTGGGACCTCAGCCATAG 


Seq42-R 


GGATTGTTAGGGTCGTATTCG 


Seq54-R 


TTTAGCCCCTTTTGCCAG 


Seq15-F 


CCTGAAGAAGATGCTCGTG 


Seq17-F 


TGATGACCGAAGACAAACG 


Seq24-F 


GAGACGAAGAAGCATTTGC 


Seq28-F 


TCACTTGGGGACAAACTAAC 


ndoS2-F-BamHI 


CTGTAAGGATCCAGGAGAAGACTG 


ndoS2-R-Xbol 


GAAACCTCGAGTCTTTGTAATCGTAGGACTT 


ndoS2(E-L)-F-BamHI 


CTAGATATTGATATTCTTCACGAATTTACGAAC 


ndoS2(E-L)-R-Xhol 


GTTCGTAAATTCGTGAAGAATATCAATATCTAG 
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J. Sjogren and others 



Table S3 Accession numbers for EndoS-like protein and 16S rRNA sequences 



For the EndoS-like protein dataset, outgroup sequences are indicated with an asterisk. For the EndoS 2 sequences of the present study, see Table S1 . 



Taxa 


Strain 


GenBank® accession number 
EndoS-like protein 


16S rRNA 


ActinoiTiycBS nsuu 


bvoUzyAo 


LY _ \ IIWUjoo 




DcaUVSlla UaSSldlld 


arccc open 


C IPR79/0 




DliluuudUBilUlil lUilyUni SUOSp. lUiailllb 


10/ r 


VP nn^90Q99R 
I r^U^tiUy^^o 


APnmfiQn 
rtrU iuoyu 


DtllUUUdUBllUlll luliyUtll SUOSp. Illldllllb 


ATPP mfiQ7 

tti oi< iooy/ 


vp nn9^93Qnn 


ArU lUooy 


DliluuudUdilUiil luiiyutu suosp. lUllyUili 


AA R 
44 D 


CI IQ9199 




DlllUUUdLiGltUIII lUilyUHl buUb\). luiiyUHl 


l-DD 


CI 1 9m RK 
tlJZo lou 




DldLilyudLlclluliI IdcLlUlil 


UOlvl 40 I U 


vp nn^mRQ77' vp nn^mt;^7i 
i r_uuo i uoy / / , ir_uuoiooo/i 


Ay i \joz 


udlllUUdblul lulll IlldlldlUlltdUbulll 


ATPP "WRfi 


7P m97Q9fift 


IF74Q9RQ 
Or/ 49^09 


Caffiobactofium maltafomaticum 


I MA9R 
LIVIAZo 


r r_uuoyyooyy 




ulll ybnUUdUnl lul 1 1 IllnlllliyUbupilbulll 


ATPP 139^4 
Al UL/ \Oc.oH 


A \lf\A^A^ 
r\i) 1 \JHDH 1 




iiOfUybdpS lillllldllb 


pym 


CP,YQQ7R7 
CuAoy/ D( 


nUJuoU/ J 


UUIylltlUdLtcllUIII Uipillllul Ida 


iLH I 


vp nnm9R3^n 
T r_uuu i duoou 


ppnn39n7 

UrUUo^U/ 


fnri/nahortarium rlinhihorioo 
uUiyilcUdLlcllUIII Uipillllul Ida 


Dno 


I r_UUJ 1 0 1 C-Ho 


ppnn39na 

LirUUo^U9 


irUiyilcUdLlclluiIi uipilUldfldc 


P7 { G\ 


vp nnc;iRRfin7 
rr_uun iDouU/ 


ppnOQ9in 
L-rUuoZ IU 


l/UiyilcUdbtcllulll UljJIIlllul Ida 


pnpF R^Q9 


vp nnm^fiio, 

T i_UUu \ oH\J 1 9 


ppnn39ii 

UrUUo^ I I 


fnri/nahortarium rlinhihorioo 
uUiyilGUdblCllUIII Uipillllul Ida 


IMPA AfY) 


VP 011^19^^30 
i r_uuu i tioOoa 


ppnn39DR 


Cnri/nahoriarium rlinhihorioo 
uUiyilcudbtcllUIII Uipillllal Idc 


r VVu 


vp nnm^fiQQ 

T r_UUu Itouaa 


PPnflR91R 


C* nri/nahor*iarit im rlinhihorioo 
UUiyilcUdLlCllUlIi UipilUldlldC 


^1 A 
0 I rt 


r r_UUu 1 Dooyy 




C 1 nr\tnnhor*tarii im rlinhihorioo hw iniarmorlii to c-ir 
l/Ui yiluUdLlui lull 1 Uipillllclldu UV. lillciltluUIUS Ml. 


mptp 1 


clrxOOo^io 




C 1 nrvinohortorit im ncoi irlnii ihoroi ilnoic 
uUI yi InUdblui lul 1 1 pbnUUUluUul bUIUblb 


o I 


VP 00^91437^ 
T r_uuOd i Ho 1 □ 


ppnnR49i 


C 'nri/naho/^tarii im ncoi irlnii ihoroi ilneie 
l/UI yi IcudLtcl lul 1 1 pbnUUUluUal bUIUblb 




T r_UUOo JO4D0 


UrUUoo4U 


CofynBbactBfium psBudotubofcuiosis 


Pnl R9 


vp nnfi/iQ7Q7f: 


PP00QR^9 


CofynBbactBfium pscudotubofculosis 


rnU4 1 


VP nOQ7Q^90C 


PPH09007 


CofynBbactBfium pscudotubofcuiosis 


pat 'i n 
ml 1 U 


r r_UUOuy loo 1 


PP009Q9/1 

UrUU£y^4 


CofynBbactBfium pscudotubofculosis 


VQ-| 007 






CofynBbactBfium pscudotubofculosis 


o Id 


vp nn^QO/im a 
r r_UU0oU40 \h 


PP00Q077 

UrUUoU/ / 


CofynBbactBfium pscudotubBfcuiosis 


PIP 07 

Ulr DZ.tif 


VP r\C\RRQRAAR 

r r_UUjuyo44o 




CofynBbactBfiuffi uiccfans 


ouy 


r r_UUo/ 1 loyo 


rpno97on 


CofynBbactBfium ulcBfans 


DP An99 


VP 00/1KQ07/1Q 

Yr^U^uoU/^y 


PP0097Q-1 


ElizabBthkingia anopholis 


Agl 


7P OO/HEQQQ 

Zr_Uy4 luooo 


CC/19C/19C 


ElizabBthkingia msningosBptica 


CD A9 


ci A^/lc 
rLAIVIL 




EntBfococcus faBcalis 


ATPP 90900 

ai oi/ ^y^uu 


7P 0/I/1QQQ99 

Zr_U44oyoZZ 


Al/hMJ 1 UUUUo 1 


EntBfococcus faBcalis 


PP1 PC 

Ub I nr 


PP0n9R9"l 




EntBfococcus faBcalis 


VOoo 


MP Q1QQ17 

INr_o loa I / 




EntBfococcus faBcalis 


ucp-1 n/i/i 
ntn lU^^ 


A AP90/177 




EntBfococcus faBcalis 


Uoo 


7P 0CCC10Q9 

Zr_UDOD IUoZ 




EntBfococcus faBcalis 


Lnvoo 


C IWQfi£Q7 
LJVODOO/ 




EntBfococcus faBcalis 


ppi -i 
r\j 1. 1 


7P 0R7/1RD7Q 

Zr_Uu/ 400/0 




EntBfococcus faBcalis 


noUo 


C IWQ0/I90 

LJVoy4^U 




EntBfococcus faBcalis 


D7-1 9 
n/ 1(1 


7P 0RR9n79R 

Zr_UuuoU/Zu 




EntBfococcus faBcalis 


T1 
I I 


7P 0^/l999QQ 

Zr_Uo4^^Zoy 




EntBfococcus faBcalis 


T9 


7P 0^/19^90/1 




EntBfococcus faBcalis 


TQ 


7P 0^C^n9^QQ 

Zr_UOOU^Ooy 




EntBfococcus faBcalis 


TQ 

lo 


7P r\RR£f\AA7 




EntBfococcus faBcalis 


Tvnoi 9 
1 AUU 1 Z 


CCT0K9C0- CCT0CC1C' CCT0CC90 

Lr I yjtibU, Lr I yoo lb, Lr I yoo^u 




EntBfococcus faBcalis 


Tvm o/i 
I AU I U4 


7P OQQC^nOO'l 

Zr_UoyoUUU I 




EntBfococcus faBcalis 


Tvm oo 
I AU iuy 


7P 07^R71/I0 

Zr_U/00/ I4U 




EntBfococcus faBcalis 


TVOQOOP 

I AUdUyb 


CCI IQCQ9Q 




EntBfococcus faBcalis 


TVO/170 

1 AU4/U 


7P 077R0/1R/I 




EntBfococcus faBcalis 


1 AUoDD 


7P (MRRRIOR 
Zr_U/000oZu 




EntBfococcus fascalis 


1 AUouU 


7P 07££QQ1 A 

Zr_U/Ojyo 14 




EntBfococcus faBcalis 


1 A 1 6ZZ 


7P 0/MQ£7/1Q 

Zr_U44oj/4o 




EntBfococcus faBcalis 


I A I oH I 


CCI M 9-1 Q7 

trU IZ lo/ 




EntBfococcus fascalis 


TV-1 Q/19 
1 A 1 o4Z 


CCI M £QQR 

tru looyo 




EntBfococcus faBcalis 


TV-1 Q/IR 
1 A 1 OHU 


CCI M RR9H 

trU 1 DuZU 




EntBfococcus faBcalis 


TY1/1R7 
1 A 1 HV 1 


CPPRQQ1 Q 

tbbooy I o 




Fntprnrnrnis faprpilis 

t-ltlUI UUUUUUO ICtGliQItO 


7X2137 


FFT^8fiS4 

1_ I I JUUJl 




EntBfococcus fascalis 


TX4244 


EFT91060 




EntBfococcus gallinamm 


EG2 


ZP 05648160 




EntBfococcus sacchamlyticus 


30_1 


ZP_091 10633 




HbIcococcus kunzii 


ATCC 51366 


ZP 09737336; ZP 09738693 




HbIcococcus kunzii 


22 


NR 029237 




HbIcococcus kunzii 


M112 


JN861738 




Lactobacillus zeas 


KCTC 3804 


ZP 09453558 




Lactococcus gafvieae 


ATCC 49156 


YP 004779109 


AP009332 


Listeriaceae family bacterium 


TTU M1-001 


ZP 09890591 


JQ287762 


MBllssococcus plutonlus 


ATCC 35311 


YP 004455741 


AP012200 
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Table S3 Continued 



Taxa 


Strain 


GenBank® accession number 
EndoS-like protein 


16S rRNA 


IViullbbUbUbbUb UlUlUiltUb 


UAI 00 1 


T r_UUOo£U 1 ou 


APD1 99R9 


raulllUablllUb alVcl 


UOIVI L.O 


7P 10RRRR97 
LY _ 1 UOOOD£/ 


A 1390401 
AJO£U4y I 


Pi An ihlnl 1 It ic rinnrlriHfn rmic 

raclllUablllub UcilUIIUIUtllllb 


0404 


7P nQR7R19fl 




Poonihopilli ic nnnillioo 
raulllUablllUb jJUpilllan 


ATPP 147f)R 
Al OL/ I4f UO 


PAP 'ALASKA 
\Xt\\j"s 1 404 


AR07R1QR 

adui o i yo 


Oil UjJlUbUbbUb ualllb 


F^l 73-997 

lOL LO ill. 1 


7P 10974^31 

L\ I \Jc. 1 4 JO 1 




StfBptococcus bqui subsp. BQUI 


A0A7 

4U4/ 


vp oo974^Rm 

I r_UU£f 400 10 


rivldU4o0o 


OtlcpiUbUbbUb uljUl bUUb\). lUUcylUulillbUb 


IvIuLjO I uooo 


VP 0091997^3 
I r_UU£ I Lei Do 


PPflfH 19Q 
UrUU I I ^y 


OtlcpiUbUbbUb uljUl bUUb\). lUUcylUulillbUb 


ATPP "WdR 
Al L/L/ 00*140 


AdJ£4000 


ppnn9Q04 
uruUiiyu4 


OtlcpiUbUbbUb uljUl bUUb\j. LUUcylUCllllbUb 


H7n 
n/ u 


VP 00974^197 
I r_uu£f 40 i ti/ 




Otl CjJIUbUbbUb IblalUI 1 


707-05 


7P nQ19711fv7P DQ1 ^RQTn* 
z.r uy i £ / i i u, £r uy i ^uy / u 




OliepiUbUbbUb tbldtuil 


/ uo-uo 






OtlcpiUbUbbUb llllac 


y 1 1 / 


7P 1inR7Qd^ 
z.r_ i i uo/ y4o 




OtlcpiUbUbbUb pyuycitcb 


AldU4y 


VP nnfifl79fifi9 
i r_uuou/ cuDl. 


PPflfl^ORR 

uruuouoo 


OtlcpiUbUbbUb pyUycllcb 


IV I I uAO 


MP 9RQR1R 
Nr_^oyo i o 


AFOORfim 
ACUUOO I 0 


OtlcpiUbUbbUb pyUycllcb 


IVIuAOO 1 J 


MP RR^^79 


AF014fl74 
ACU I4U/ 4 


Otl CjJIUbUbbUb jjyuycllcb 


IVIUAOJUUJ 


YP 9R9QfH 


ppnnnni7 

uruuuu i / 


OtlcpiUbUbbUb pyUycllcb 


MPA^RIRO 
IVIuAOO 1 OU 


YP 9ftnQQ9 
i r_dOuyyii 


ppnnno^R 

uruuuuoo 


OtlcpiUbUbbUb pyUycllcb 


MPA^R939 

IVluAo0£O£ 


MP RD7RRR 
INr_0U/ 000 


AFOOQQ^d 
Acuuyyo4 


OtlcpiUbUbbUb jjyuynllnb 


ly/ir; ACQ/1 on 
iviuAoy4£y 


YP ^0797^ 
I r_oy / Li o 


PPflfin9 c lQ 
uruuudoy 


OtlcjJlUbUbbUb jjyUycllcb 


MPA^m970 
IVIuAu I UCi U 


YP fiQcmR 
i r_oyy^ i o 


ppnnn9Rn 
uruuudou 


on ujJiUbUbbUb yyuyciicb 


IVIUAO i uoyt 


YP flRDRdR 
i r_uuuo40 


ppnnnnn^ 

\j\ uuuuuo 


tyrpnfnrnrn i° nvnnpnp 0 . 


MGAS1 07^0 


YP finsoq^ 


ppnnn?R? 


Streptococcus pyogenes 


MGAS15252 


YP 005389405 


CP003116 


Streptococcus pyogenes 


str. Manfredo 


YP_001 127900 


AM295007 


Streptococcus pyogenes 


NZ131 


YP 002286383 


CP000829 


Streptococcus pyogenes 


ATCC 10782 


ZP 07460044 




Streptococcus pyogenes 


HKU QMH11M0907901 


EIK41346 




Streptococcus pyogenes 


M49 591 


ZP 00365754 




Streptococcus pyogenes 


CS101 


U78969J* 




Streptococcus sp. group C 


C116 


ADC53484 




Jrichosporon asahii var. asahii 


CBS 2479 


EJT46459 
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